Multivariate Geographic Clustering Using aBeowulf - style Parallel

نویسنده

  • William W. Hargrove
چکیده

The authors present an application of multivariate non-hierarchical statistical clustering to geographic environmental data from the 48 conterminous United States in order to produce maps of regions of ecological similarity called ecore-gions. Nine input variables thought to aaect the growth of vegetation are clustered at a resolution of one square kilometer. These data represent over 7.8 million map cells in a 9-dimensional data space. For the analysis, the authors built a 126-node heterogeneous cluster|aptly named the Stone SouperComputer|out of surplus PCs. The authors developed a parallel iterative statistical clustering algorithm which uses the MPI message passing routines , employs a classical master/slave single program multiple data (SPMD) organization, performs dynamic load balancing, and provides fault tolerance. In addition to being run on the Stone Souper-Computer, the parallel algorithm was tested on other parallel platforms without code modiication. Finally, the results of the geographic clustering are presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multivariate Geographic Cluster Using a Beowulf-style Parallel Computer

The authors present an application of multivariate non-hierarchical statistical clustering to geographic environmental data from the 48 conterminous United States in order to produce maps of regions of ecological similarity called ecore-gions. Nine input variables thought to aflect the growth of vegetation are clustered at a resolution of one square kilometer. These data represent over 7.8 mill...

متن کامل

Multivariate Spatio-Temporal Clustering of Time-Series Data: An Approach for Diagnosing Cloud Properties and Understanding ARM Site Representativeness

A multivariate statistical clustering technique— based on the iterative k -means algorithm of Hartigan (Hartigan, 1975)—has been used to extract patterns of climatological significance from 200 years of general circulation model (GCM) output. Originally developed and implemented on a Beowulf-style parallel computer constructed by Hoffman and Hargrove from surplus commodity desktop PCs (Hargrove...

متن کامل

9.4 Using Clustered Climate Regimes for Understanding Water Cycle Variability

A multivariate statistical clustering technique— based on the iterative k -means algorithm of Hartigan (Hartigan, 1975)—has been used to extract patterns of climatological significance from 200 years of general circulation model (GCM) output. Originally developed and implemented on a Beowulf-style parallel computer constructed by Hoffman and Hargrove from surplus commodity desktop PCs (Hargrove...

متن کامل

Multivariate Spatio-Temporal Clustering of Times-Series Data: An Approach for Diagnosing Cloud Properties and Understanding ARM Site Representativeness

A multivariate statistical clustering technique—based on the iterative k-means algorithm of Hartigan (Hartigan 1975)—has been used to extract patterns of climatological significance from 200 years of general circulation model (GCM) output. Originally developed and implemented on a Beowulf-style parallel computer constructed by Hoffman and Hargrove from surplus commodity desktop PCs (Hargrove et...

متن کامل

Multivariate Spatio-Temporal Clustering (MSTC) as a Data Mining Tool for Environmental Applications

The authors have applied multivariate cluster analysis to a variety of environmental science domains, including ecological regionalization; environmental monitoring network design; analysis of satellite-, airborne-, and ground-based remote sensing, and climate model-model and model-measurement intercomparison. The clustering methodology employs a k-means statistical clustering algorithm that ha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011